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ABSTRACT 

General relativity poses serious problems for counterfactual propositions peculiar to it as 
a physical theory. Because these problems arise solely from the dynamical nature of space- 
time geometry, they are shared by all schools of thought on how counterfactuals should 
be interpreted and understood. Given the role of counterfactuals in the characterization 
of, inter alia, many accounts of scientific laws, theory confirmation and causation, general 
relativity once again presents us with idiosyncratic puzzles any attempt to analyze and 
understand the nature of scientific knowledge must face. 
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1 Precis 


In his elegant, magisterial exposition of the foundations of general relativity, Malament (2012, ch. 2, §1, 
pp. 120-121) provides three interpretive principles to endow the mathematical framework of Lorentzian 
geometry with physical content:^ 

For all smooth curves 7 : / —>■ M [where / C K is an open interval and M is a candidate 
spacetime manifold]: 

(Cl) 7 is timelike iff 7 [/] could be the worldline of a point particle with positive mass; 

(C2) 7 can be reparametrized so as to be a null geodesic iff 7 [/] could be the trajectory 
of a light ray; 

(PI) 7 can be reparametrized so as to be a timelike geodesic iff 7 [/] could be the worldline 
of a free point particle with positive mass. 

(Emphases are Malament’s; ‘C’ indicates the proposition pertains to the interpretation of conformal 
structure, ‘P’ to projective structure; (Cl) articulates the physical meaning of timelike curves, (C2) 
that of null geodesics, and (C2) that of timelike geodesics.) He immediately offers four comments 
and qualifications to address possible concerns one may have with these propositions as interpretative 
principles, touching on questions about the exclusion of tachyons, the restriction to smooth curves, 
the status of point particles in general relativity, and, of most interest for our purposes, the modal 
character of the propositions. He concludes {ibid., p. 122), 

Though these four concerns are important and raise interesting questions about the role 
of idealization and modality in the formulation of physical theory, they have little to do 
with relativity theory as such. 

I agree with his conclusion in all parts, except for the concern about the role of modality. I think there 
are important problems with modality in general, and with the understanding of counterfactuals in 
particular, peculiar to general relativity as a physical theory, problems that have gone unremarked in 
the philosophy and the physics literature.^ Malament’s formulation and discussion of the interpretative 
principles allows them to be drawn out with great clarity.^ 

follow Malament (2012) in all relevant conventions (the signature of the spacetime metric, the definition of the 
Weyl tensor, etc.). The reader should consult that work or Wald (1984) for exposition of all concepts and results about 
general relativity I rely on in this paper, except where explicitly noted otherwise. 

^Strictly speaking, as the discussion in §3 will make clear, the problem is not restricted to general relativity, but 
rather infects all theories with spacetime structure that is dynamical by virtue of a particular kind of coupling with 
matter sources, such as /(ij)-gravity. (See, e.g., de Felice and Tsujikawa 2010 for an extended review.) I focus on general 
relativity as it our most strongly confirmed and most widely accepted theory of spacetime structure, and everything I 
say about it can be translated easily into the context of similar theories. The problem does not arise, however, simply 
by virtue of any non-trivial relation a theory may posit between geometry and matter: the problem does not arise, for 
instance, in classical Yang-Mills theory on Minkowski spacetime. 

emphasize that I am not criticizing Malament or trying to draw attention to weaknesses or errors in his exposition, 
quite the contrary. It is the exemplary (and characteristic) clarity, precision and thoroughness of his discussion that 
allows a previously unrecognized problem to be brought to light. 


2 



About (C2) he says {loc. cit), “We are considering what trajectories are available to light rays 
when no intervening material media are present—i.e., when we are dealing with light rays in vacuo.” 
Now, surely we want to talk as well about the null cones even at those places where matter is present. 
In order to do so, and in order to formulate the analogue of (C2) for those spacetime regions (in order 
to give a physical interpretation to the null cones at those points), we must say something along the 
following lines: the null geodesics where matter is present are those paths light rays would follow if the 
matter there were removed. But on its face, that modal statement makes no sense in the context of 
general relativity, because however we make sense of the idea of “removing matter” from a spacetime 
region, the metric will eo ipso be different in that region from what it was, and it will generically be 
the case that the new metric in that region will not agree with the original metric on what it counts 
as null vectors, much less on what it counts as null geodesics, among many other differences.^ The 
distribution of matter in a region of spacetime in large part informs the metrical structure there, so 
what sense can be made, in the context of the theory, in asking what the metrical structure would be 
if the matter actually there were not there? And now we face the heart of the problem: the lack of a 
unique vacuum solution for the Einstein field equation forces an ineliminable ambiguity in the idea of 
what it may mean to “remove matter from a region of spacetime”, guaranteeing that we have no way 
to conclude on any principled basis “what the metric would then look like there”. 

2 Counterfactuals in Physical Theories 

One of the simplest and prima facie most promising ways to begin to try to get a grip on the problem, 
and to look for its resolution, is to treat the proposed counterfactual changes as represented by a change 
in initial conditions, and so to use the machinery of general relativity’s initial-value formulation to 
fix the solution. Indeed, this is the natural and effective way of dealing with such counterfactuals 
in all other physical theories. When one wants to know how the state of a system would change if 
some of its properties or some properties of its environment were to change, one first writes down the 
initial-value formulation (or boundary-value problem, depending on the details of the theory and the 
problem at hand), plugs in the original values for all parameters, and calculates the solution; one then 
does the same thing using the new, counterfactually changed values for the parameters, and compares 
the new solution to the old. Easy as pie, and about as philosophically unproblematic as things can 
possibly get for a large class of fundamentally important counterfactuals. 

What makes this procedure work in almost all physical theories is the context of a family of fixed 
background structures, often spatiotemporal ones, with respect to which one has natural ways of 
identifying and so comparing entities like “the same quantity of the same system at the same place 
and same time, under otherwise different conditions or in otherwise different states”. To see how this 

^Indeed, in regions of spacetime filled, say, with a non-trivial dielectric, there simply are no null curves that are “the 
possible paths of light rays” because the full resources of electromagnetic theory tell us in this case that, under any 
reasonable construal of “propagate”, light does not propagate through a dielectric with the speed of light in vacuo. 

The same problem arises for timelike curves in regions of spacetime already occupied by matter, i.e., for (Cl) and 
(PI), but I focus on the case of null rays to simplify the exposition. 


3 



works in a simple case, one that has especial relevance to the problem in general relativity, consider 
the situation in Newtonian gravitational theory. It makes perfect sense in Newtonian theory to reason 
counterfactually about the behavior of a given kind of system in the presence or absence of any other 
kind of system, since that presence or absence won’t affect the kinematical structure of Newtonian 
spacetime: it is always with a fixed foliation of simultaneity slices, and a fixed, flat affine structure 
defining the possible trajectories of all freely falling bodies (Stein 1967).® There is no problem in 
principle in computing the counterfactual change in gravitational forces in a region induced by any 
counterfactual changes in the distribution of matter anywhere in the spacetime. For example, one 
may be interested in the question: what would happen to the orbits of the planets in the Solar System 
if the sun were to vanish? Nothing simpler. Plug in the new values for the sun’s size and mass 
{viz., zero), compute the new orbits, and compare them to the original ones by using the background 
simultaneity and affine structures as referential framework. 

The virtues of this method for the representation, interpretation and evaluation of counterfactuals 
in the context of physical theory are legion. For our purposes, perhaps the greatest virtue is this: 
pragmatics plays absolutely no role. It does not matter what the investigator’s purposes are, or the 
exact experimental techniques one envisions as being employed, or anything of the sort. There is no 
need for the ad hoc fixing of comparison classes of systems, or for the ad hoc fixing of methods for 
identifying “the same quantity of the same system at the same place and same time, under otherwise 
different conditions or in otherwise different states”. It’s all fixed naturally and canonically from the 
start.® 

Famously in general relativity, there is no fixed, absolute, background structure one could use as 
such a referential framework for the comparison of properties of different solutions to the Einstein 
field equation. Thus, prima facie, there is no way to employ the standard machinery of the initial- 
value formulation (or the boundary-value problem) to represent, render meaning to and evaluate such 
counterfactuals in general relativity. 

3 The Problem in Detail 

To begin to come to grips with the problem in more detail, let us consider the problem of the sun’s van¬ 
ishing translated from the Newtonian context into that of general relativity: what would Schwarzschild 
spacetime look like if its central mass were removed? Perhaps the immediate, intuitive (and naive) 
response is to say: obviously the result will be Minkowski spacetime. The plethora of available vacuum 

^Nothing of importance for the example would change if we were to work in the more sophisticated context of 
geometrized Newtonian gravity (Malament 2012). I suspect, however, that in generalizations of geometrized Newtonian 
gravity, in which one does not restrict the topology of spacetime to and one does not (effectively) require that the 
simultaneity slices be spatially flat (Malament 2012), then similar problems would arise. They would, however, arise for 
the same reasons as they do in general relativity: the absence of fixed, absolute, background spatiotemporal structure 
because of dynamical coupling with matter (footnote 2). 

®This is not to say there no choices at all to be made in this context. One might decide in the Newtonian example, 
for instance, to consider only smooth solutions, or to allow continuous or even distributional solutions, in the fixing of 
the comparison class. This choice, however, does not affect the naturalness and canonicity of the procedure. 
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solutions, however, shows the choice is not so simple. 

To see the issues involved more clearly, let us try to articulate the problem in a more explicit, 
precise and rigorous fashion, using the idea of taking the limit of a continuously varying family 
of spacetimes. We start with Schwarzschild spacetime of a given mass, and consider a family of 
Schwarzschild spacetimes parametrized by mass continuously shrinking to zero. It may be initially 
surprising to learn that such a limiting family has no unique limit. (This follows immediately from 
the fact that any reasonable topology on a non-trivial family of spacetimes is non-Hausdorff; see, 
e.g., Curiel 2014.) Working the example out in a little detail shows clearly what is going on.^ In 
Schwarzschild coordinates, using the parameter A = M~^l^ (the inverse-third root of the Schwarzschild 
mass), the metric takes the form 

" i) " i) (1) 

This clearly has no well defined limit as A —>■ 0. Now, apply the coordinate transformation 

f = Ar, i= p = \~^9 

In these coordinates, the metric takes the form 

^A^ — d? — ^A^ — siv? [Xp] d(j)^) 

The limit A —>■ 0 exists and yields 

— vdP -I- ^df^ — f‘^{dp^ + fP'dfj)^) 
r 2 

a flat solution discovered by Kasner (1921). If instead of that coordinate transformation we apply the 
following to the original Schwarzschild form (1), 

X = r + X~‘^, p = X~^6 

then the resulting form also has a well defined limit, which is the Minkowski metric. The two limiting 
processes yield different spacetimes because it happens behind the scenes that “the same points of 

the underlying manifold get pushed around relative to each other in different ways”. Because the 

coordinate relations of initially nearby points differ in different coordinate systems, those differences 
get magnified in the limit, so that their final metrical relations differ. Thus, the limits in the different 
coordinates yield different metrics, with no natural or preferred way to say which is the “correct” 
limit—which, if either, is the correct spacetime that results from counterfactually removing the sun 
from the Solar System. 

The root of the problem lies in the fact that metrical curvature is only in part informed by the 
distribution of matter: the Weyl curvature at a point, exactly that part of the curvature encoding 
conformal information, such as what counts as a null vector, is independent of the value of the stress- 
energy tensor at that point—the value of the Weyl tensor, point by point, is not constrained by the 
^The following analysis is taken from Geroch (1969). 
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presence or absence of matter. In regions without matter, moreover, metrical curvature is governed 
entirely by the Weyl tensor. Still, the Weyl tensor C°'bcd is subtlely related to the distribution of 
matter at neighboring points, when there is such matter, in a way that can be made precise by using 
the Bianchi identity formulated using the so-called Lanczos tensor.® Thus, in “removing matter” from 
a spacetime region, there can be no principled way to determine what the “remaining curvature” will 
be. One may decide to keep the Weyl tensor the same. But precisely its relation to stress-energy by 
way of the Lanczos tensor means that this is not an unproblematic way to proceed, and is likely even 
incoherent or inconsistent.® One may sum all these issues up by adverting to the fact that there is 
not a unique vacuum solution to the Einstein field equation: the form of the Einstein field equation 
for a vacuum region remains the same no matter what the field equations are for matter immediately 
outside the vacuum region, but those exterior matter fields do not suffice to fix the solution for the 
vacuum region.^® 

Now, the problem with this example is, in fact, even more acute than the discussion so far shows: 
there are an innumerable number of possible spacetimes, not all of them flat, with many different 
topologies possible, that one can derive by rigorous limiting processes from Schwarzschild spacetime 
as the central mass goes to zero.^^ The same problems already discussed arise for all of these, 
compounded by the great number and variety of possible topologies the limits may respectively take. 
Let us say, however, that we have, in some way or other, decided that we are interested in the limit that 
takes Schwarzschild spacetime to Minkowski spacetime, and in particular we are interested in the ways 
that geodesics near the event horizon will change over the course of the limit. The natural topology of 
the manifold of Schwarzschild spacetime is K® x S®. The natural topology of Minkowski spacetimes, 
however, is Thus, in an intuitive sense spacetime points will “disappear” in the limiting process, 
because one must “de-compactify two topological dimensions” to derive from K® x S®. There are 
many ways to effect such a de-compactification; all the simplest, such as that based on Alexandrov 
compactification, work by the removal of a point or set of points from the topological manifold.^® 
How is one to identify which spacetime points in the limit manifold are those that were “close to 

®The Lanczos tensor is defined as follows: 


'^abc -“t 


= 47rV[i,T,j]^ - —SclfcVajT 


( 2 ) 


The Bianchi identity may then be rewritten 


^ nC abc — Jahc 


Thus the value of the Weyl tensor at a point does depend in an indirect way on the distribution of matter at nearby 
points. 

®It should therefore be clear that these sorts of problem arise not only for counterfactuals involving changes in the 
distribution of matter, but also for any involving changes in the curvature more generally. One may, for example, try 
to consider how the behavior of test-particles in a vacuum spacetime would change if one were to “alter a component 
of the ambient gravitational radiation”. 

^®It was, as a matter of fact, exactly this issue that first led Einstein to formulate his infamous Hole Argument (Curiel 
2015). 

'^^See, e.g., Paiva, Rebougas, and MacCallum (1993) for explicit construction of a few different ones. 

^^See, e.g., Kelley (1955) for an account of methods of compactification, including the Alexandrov type. 
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the event horizon” in the original Schwarzschild spacetime manifold x S^? 

The problems discussed here with the representation, interpretation and evaluation of counter- 
factuals in general relativity are not restricted to the case of removing or otherwise altering matter 
distributions. Similarly problematic modalities, requiring the fixation of a relevant comparison class of 
spacetimes and the identification and comparison of structure and properties across spacetime models, 
arise all over the place in general relativity. I offer a brief sample. 

1. the proofs of all the Laws of black-hole mechanics (Wald 1994) 

2. the proof of the No-Hair theorem for black holes (Heusler 1996) 

3. proofs that various causality conditions imply each other, e.g., that the absence of closed timelike 
curves implies the absence of closed causal curves (Hawking and Ellis 1973) 

4. proofs of geodesic theorems (Geroch and Jang 1975; Ehlers and Geroch 2004) 

5. well-posedness (in the sense of Hadamard) of the initial-value formulation (Wald 1984) 

6 . formulation of and arguments for the Cosmic Censorship Hypothesis (Joshi 2003) 

7. formulation of and arguments for Penrose’s Conformal Curvature Hypothesis (Penrose 1979) 

8 . proof of the Topological Censorship Theorem (Friedman, Schleich, and Witt 1983) 

9. the possible extendibility of spacetimes past singularities, and the possible hole-freeness of a 
spacetime in general (Manchak 2014) 

10 . taking “small” perturbations oft a fixed spacetime, e.g., in considering slight inhomogeneities in 
the cosmological FLRW models (Szekeres 1975a), or in treating slightly aspherical gravitational 
collapse (Szekeres 1975b) or more generally distorted black holes (Geroch and Hartle 1982) 

11 . the genericity or scarcity (or stability or rigidity) of a given spacetime property, e.g., the existence 
of singularities or of large-scale cosmological structure or of a given range of values for a constant 
of nature (Curiel 2014) 

4 Severity of the Problem 

There is no single algorithm or reasoning procedure to employ for all such possible cases of counter- 
factual (and more general modal) reasoning in general relativity, and certainly no natural or canonical 
one. One must handle such situations on a case-by-case basis, coming up with a method to fix the 
comparison class and similarity structure in the proposed counterfactual situation in some way that 
respects the particular constraints of the project the counterfactual reasoning is to play a part in, 
while at the same time making possible the identification and comparison of structure and properties 
across spacetimes in the comparison class: the intervention of pragmatics is inescapable, and will 
perforce depend on ad hoc constructions and arguments. 

The astute reader, however, may now immediately want to respond that in many interesting 
accounts of counterfactuals in the literature, there are formal devices whose sole purpose is to manage 
the pragmatics of their interpretation and evaluation. For example, this is exactly what the freedom 


7 


in the selection of “similarity” measures in a Lewisian or Stalnakerian account of counterfactuals is 
supposed to accommodate. But the problem is deeper here in general relativity. It is not simply 
that the choice of a similarity measure is a matter of pragmatics. It is rather that “the choice” in 
general is not a well defined set of alternatives at all. Even once one has decided in some way or 
other what the appropriate comparison class of spacetimes ought to be, and what is to count as 
the relevant measure of similarity among the spacetimes, still one simply does not know how, in a 
general, abstract way, to make sense of the idea “different values or forms of the same structure in 
different spacetime models” (or, in Lewisian terms, in different possible worlds in a family of mutually 
accessible ones). Even putting aside the problems posed by differing topologies in spacetimes one 
may want otherwise to judge similar in some important fashion (as discussed in §3), the diffeomorphic 
freedom inherent in general relativity makes it in general impossible to say “what is the same point 
in different spacetimes”. There is no principled reason to use x S^, say, as the diffeomorphic 
presentation of Schwarzschild spacetime rather than with a line removed (which is diffeomorphic 
to X , but not canonically so)—but how is one to identify in a principled way a point in x 
with one in with a line removed? Thus, not only does general relativity block the standard methods 
of managing the pragmatics of counterfactual interpretation and evaluation, it blocks the prior, even 
more fundamental step: the representation of the counterfactual in the given language (in this case, 
the mathematical theory of Lorentz manifolds). 

Still, one may want to ask, if this is such a problem, how do physicists handle it in practice? For 
they surely do—otherwise no progress would ever have been made in any of the problems listed at 
the end of §3. The answer is that, when engaging in arguments requiring such counterfactual and 
more generally modal reasoning, physicists do use ad hoc methods for the fixation of an appropriate 
comparison class of spacetimes and similarity measure on it. More importantly, they use ad hoc 
methods for the identification and comparison of structure and properties across the spacetimes in 
the fixed class, e.g., by requiring that the spacetimes in the family have a hxed topology, or have a 
fixed diffeomorphic presentation, or satisfy certain symmetries, and then fixing a coordinate system 
for the identification of points across spacetimes. Though those methods are ad hoc in the strong (and 
etymologically literal) sense in that they are applicable only to the specific context in which they are 
used, and cannot be generalized in any way to a method applicable in all situations across the theory 
(much less generalized in a canonical or natural way), there are often strong physical justifications for 
their use.^^ This surely suffices for all the purposes the practical physicist has in his or her theoretical 
work, and only a fool or a philosopher would cavil at it. For we philosophers, however, who are 
interested in the understanding and comprehension of the foundations of the theory as a whole, the 
problem of the interpretation and evaluation of counterfactuals in the context of the theory cannot be 

^®Even in many of these cases, however, the method used is (figuratively) ad hoc in the stronger sense that there is 
no known physically significant justification for it. This happens most often in those problems in which the analysis 
requires mathematical machinery of an even more advanced and heavy-duty sort than one needs in workaday general 
relativity. For instance, I know of no convincing physical interpretation for the Sobolev norms required to prove the 
well-posedness of general relativity’s initial-value formulation (Wald 1984). Physicists use them because they work 
mathematically, no other reason. Indeed, I have never even seen a physicist pay lip-service to the idea that the Sobolev 
norms have real physical significance. 



considered settled, or even as having a promising attack made upon it, until reasonable and plausible 
methods applicable to the theory as a whole are constructed. 

The problem I expose in this paper is severe: many influential philosophical approaches to many 
fundamental problems and issues in the philosophy of science—the meaning of fundamental inter¬ 
pretive principles, the nature of scientific laws, the stability of theory-confirmation, the nature of 
causation, et al. — rely in ineliminable ways on subjunctive conditionals for their formulation, anal¬ 
ysis and application. Physicists certainly rely on such propositions in theoretical and experimental 
practice all the time, e.g., to propose and perform tests of general relativity. What reason do we have 
to believe that we understand what is happening in such cases in the context of general relativity, 
much less to have confidence in any conclusions drawn? Because the problem arises solely from the 
dynamical nature of spacetime geometry in general relativity, moreover, what I say here is indepen¬ 
dent of one’s favorite account of counterfactuals—it depends only on the theoretical resources general 
relativity provides to model such situations and pose such propositions, no matter what ancillary tools 
or frameworks one uses to interpret and understand them. 

I wanted in this paper only to draw attention to this serious problem, not to propose possible 
solutions. I think any decent attempt to do the latter will require a great deal of involved, technical 
work, including detailed examination of many non-trivial examples. I sincerely hope someone takes 
up the challenge. I am not entirely without hope for progress to be made here. Indeed, it seems 
likely that progress on this problem could suggest new avenues of attack on the traditional problem 
of understanding counterfactuals in general, and could suggest elaborations of and improvements to 
already existing accounts. In any event, beyond the possible benefits that progress on this problem 
may have for the general problem of counterfactuals, the mettle of philosophy and the needs of physics 
demand we understand what is going on here. 
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